Perceptual Loss, Neural Distance, Embedding Similarity, Content-aware Metrics
CLIP Model Overview : Unlocking the Power of Multimodal AI
towardsdatascience.com·1d
A Training-Free, Task-Agnostic Framework for Enhancing MLLM Performance on High-Resolution Images
arxiv.org·16h
Embedding Space Allocation with Angle-Norm Joint Classifiers for Few-Shot Class-Incremental Learning
arxiv.org·1d
Cross-modal Associations in Vision and Language Models: Revisiting the bouba-kiki effect
arxiv.org·16h
L-CLIPScore: a Lightweight Embedding-based Captioning Metric for Evaluating and Training
arxiv.org·1d
Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distribution Shift
arxiv.org·16h
Accuracy Is Dead: Calibration, Discrimination, and Other Metrics You Actually Need
towardsdatascience.com·19h
Loading...Loading more...